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Exchange of data between components of distributed software 

HAVING DIFFERENT VERSIONS OF SOFTWARE 

FIELD OF THE INVENTION 

[0001] The present invention relates generally to distributed soft- 
ware systems, and particularly to co-operation between different components 
of distributed software having different versions of the software. 

BACKGROUND OF THE INVENTION 

[0002] A software system having software components installed 
apart from each other and exchanging data with each other is called a distrib- 
uted software system. An example of a distributed software system or distrib- 
uted computer system is a 'client-server' relationship in which the execution of 
a task is divided between different functional entities (i.e. computer processes 
or software components) which are specialized in different parts of the task so 
that one functional entity uses a service provided by another. The process re- 
questing the service is called a 'client' and the provider of the requested serv- 
ice is called a 'server'. Typically, the server software component(s) and the 
client software component(s) are installed on separate computers called server 
computers and client computers, respectively. Alternatively, the server soft- 
ware component(s) and the client software component(s) may be different 
software processes running in the same computer device. 

[0003] The components of distributed software are not necessarily 
updated all at once. Especially when the software is divided to server compo- 
nent(s) and to client component(s), the number of client component(s) installed 
tends to be high and simultaneous update of all of them is not possible. When 
client components are installed into portable computers, or computers other- 
wise behind possible slow-speed connections, updating the client component 
is not always an eligible option. 

[0004] U.S. patent 5,915,1 12 discloses a distributed computer sys- 
tem wherein a client computer process can request the execution of a remote 
procedure on a server computer process even when the server computer pro- 
cess does not support a current version of the client computer. The remote 
procedure requests are based on remote procedure calls (RPC's) which are 
typically defined using a remote procedure interface. U.S. patent 5,915,112 
proposes an extended remote procedure interface, which maps each remote 
procedure defined in the interface to prior versions of the remote procedure. 
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This enables client computer processes to utilize a version map from the in- 
terface when requesting the execution of remote procedures so that the re- 
quests are in a format supported by the server computer processes executing 
the remote procedures, regardless of the versions of the remote procedure 
interface in the client and server processes. In other words, U.S. patent 
5,915,112 discloses algorithms explicitly written to provide an adaptation be- 
tween different versions of the remote procedure interfaces of the client and 
server processes. These algorithms check the compatibility between the re- 
mote procedure interfaces separately for each RPC request, which is capacity- 
consuming since a significant number of RPC requests is typically involved 
with each user session. Further, such explicit algorithms are susceptible to er- 
rors and laborious to maintain. The susceptibility to errors is particularly a con- 
sequence of the fact that any change in a data structure does not propagate to 
the adaptation algorithms unless the programmer remembers to make the 
necessary changes in the algorithms. If the algorithms are not reprogrammed 
to take into account the change in the data structure, the system will be unsta- 
ble. In other words, a version mapping is manually written for previously known 
software versions. In practice, this involves quite a difficult binding of the algo- 
rithms and mapping to version-specific interface definitions. Finally, in the 
system according to U.S. patent 5,915,112, the client process must have 
means for being compatible with the server process. 

[0005] International patent application WO 00/75784 discloses soft- 
ware translation using metadata. In this prior art, too, the metadata is written 
for previously known versions so that the metadata as such is a map between 
the differences of these versions. Thus, the metadata must include explicit 
transformation definitions. This approach is laborious to implement in practice 
and a complicated solution that makes application development inflexible and 
does not support the rapid cycle development of a product that consists of 
separate components exchanging data with each other. 

A SUMMARY OF THE INVENTION 

[0006] An object of the invention is to provide a more reliable and 
flexible method for enabling different components of distributed software hav- 
ing different versions of the software to co-operate. 

[0007] The basic idea of the invention is to serialize data between 
different components of distributed software by using data exchange metadata 
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descriptions. Serialization is used in transferring data from one component to 
another. In other words, data is serialized at a sending end, transferred to a 
receiving end, and deserialized at the receiving end. In a preferred embodi- 
ment of the invention, serialization is an operation wherein data content of a 
data object is transformed into a stream of bits. The bit stream can be respec- 
tively deserialized in order to reconstruct the data content into a compatible 
data object at the receiving end. The data exchange metadata description of a 
software component version comprises information on definitions for data 
structures to be serialized of the specific software component version. A dedi- 
cated data exchange metadata description is produced for each software ver- 
sion that is delivered. 

[0008] A delivery version of a counterpart version is identified, pref- 
erably in the beginning of a data exchange, by the software component having 
a newer version. The newer version is responsible for loading the data ex- 
change metadata of the counterpart version and for using the data structures 
of the counterpart in both serialization and deserialization. Thus, the software 
component having a newer version is able to exchange data with any compati- 
ble counterpart software component by means of the data exchange metadata 
of the respective component in data serialization and deserialization. The older 
version does not need to take further measures for compatibility. As a result, 
the present invention enables reliable and convenient backward compatibility 
and true flexibility for updating the components of distributed software. 

[0009] An advantage of the present invention is that there is no 
need to write a version-specific mapping algorithm separately for each pair of 
known counterpart software versions in the manner required in the prior art. As 
a consequence, the need to maintain explicit mapping algorithms used in the 
prior art, as well as the associated problems, are also avoided. Thus, delivery 
of a new software version does not require a new mapping algorithm for each 
older version. Instead, in the present invention it is only necessary to provide, 
preferably automatically, a metadata description of the relevant data structures 
of the new software version. The actual mapping, called a serialization scheme 
herein, is then created, and preferably also verified, automatically between the 
new software version and any older counterpart software version by means of 
a universal algorithm and the metadata descriptions of these two component 
versions, either prior to delivery or subsequent to the delivery. The present in- 
vention also improves the overall performance because very few additional 
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steps are needed in the algorithms, and the data stream itself does not have to 
carry any version and/or structure information. 

[0010] In an embodiment of the invention, a delivery of a new ver- 
sion includes only the data exchange metadata description of that new version, 
and the data exchange metadata descriptions of compatible versions of the 
counterpart software component(s) are delivered later, for example in the data 
exchange session with the respective version. In another embodiment, the 
data exchange metadata descriptions for substantially all compatible versions 
of the counterpart software component(s) are delivered with each new version 
of a software component, in addition to the data exchange metadata descrip- 
tion of that new version. 

[0011] In an embodiment of the invention, the serialization scheme 
is preferably prepared and stored by the installed new software version when a 
data exchange with an older version of a counterpart component is carried out 
first time. In another embodiment, the serialization scheme can be prepared 
and stored automatically beforehand. In a yet further embodiment, the seriali- 
zation scheme is prepared for at least one pair of software versions prior to the 
delivery of a newer version, and delivered with the new software version in- 
stead of or in addition to the data exchange description(s). 

[0012] In the present invention, the data exchange metadata de- 
scription is tied to the delivery versions of software components thus enforcing 
compatibility between complete installed entities. This approach helps to iso- 
late version compatibility checking and to reduce the number of versions to 
take into account within a single data exchange operation. Version compatibil- 
ity can be checked in a most reliable manner using the data exchange meta- 
data of the delivery versions, assuming that the data structures used for data 
exchange match those used inside application programs. As an advantage of 
the relationship between the delivery version and the data exchange metadata, 
version compatibility checking is a simple, isolated operation. It usually occurs 
only once, in the beginning of a client component session, prior to data ex- 
change. 

[0013] In a client-server concept, the newer software component 
version is typically a server component and the older counterpart software 
component is a client component. The present invention enables reliable and 
convenient backward compatibility for components of distributed software 
without compromising performance. Client component updates are not neces- 
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sary immediately after a server component has been updated. Being able to 
use a client component of an older version improves usability as users then 
have a lot more flexibility on choosing when to upgrade the client components. 
It also allows accessing server components of different versions with a single 
client component. Another advantage of performing compatibility measures on 
the newer version side is being able to immediately use new compatibility 
measures also for any old versions. 

[0014] The serialization scheme may be built by request or during 
the first serialization of a given data object. Building a serialization scheme ac- 
cording to the invention involves analyzing the object including the definitions 
of all ancestors and referred objects. It is a relatively heavy operation but takes 
place only once. The invention provides fast communication since serialization 
operations are fast and straightforward as they only utilize the prepared seriali- 
zation schemes. Further, when only the essential primitive contents are serial- 
ized, the serialization performance increases and message size decreases 
accordingly. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0015] The invention will now be described in more detail by means 
of preferred embodiments of the invention and with reference to the attached 
drawings, in which: 

[0016] Figure 1 illustrates a client-server architecture; 

[0017] Figure 2 is a flow chart illustrating the delivery of a new soft- 
ware component according to an embodiment of the present invention; 

[0018] Figures 3A and 3B are flow charts illustrating an algorithm 
for writing a metadata description according to an embodiment of the invention; 

[0019] Figures 4A and 4B are flow charts illustrating an algorithm 
for compatibility verification between different software versions according to 
the present invention; 

[0020] Figure 5 is a block diagram illustrating data exchange be- 
tween an older version of a client component and a newer version of a server 
component; 

[0021] Figure 6 is a flow chart illustrating the general steps of pre- 
paring serialization, and serialization and deserialization, 

[0022] Figure 7 is a flow chart illustrating the process for preparing 
the serialization, 
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[0023] Figures 8A and 8B are flow charts illustrating the serializa- 
tion operation, and 

[0024] Figures 9A and 9B are flow charts illustrating the deseriali- 
zation operation. 

PREFERRED EMBODIMENTS OF THE INVENTION 

[0025] The present invention can be applied to any distributed soft- 
ware system, in which different components of distributed software having dif- 
ferent versions of the software must co-operate, i.e. exchange data with each 
other. An example of a distributed software system is a server-client concept 
which is illustrated in Figure 1. In the server-client architecture, the software is 
divided into server component(s) 5 and client component(s) 6 and 7 installed in 
server computer(s) 1 and client computers) 2 and 3, respectively, apart from 
each other. Alternatively, the server software component(s) and the client 
software component(s) may be different software processes running in the 
same computer device. The server components and the client components 
communicate with each other via any appropriate communication medium 4 
depending on implementation. The type of the communication medium is not 
relevant to the present invention. Examples of suitable communication media 
include various communication networks, communication links, a storage me- 
dium, and a shared memory. The invention will be now described by using the 
client-server architecture as an example, but the present invention is not re- 
stricted to client-server systems. 

[0026] Firstly, concepts of serialization, deserialization, struc- 
tural data, and a metadata description are briefly explained in order to facilitate 
understanding the invention. 

Serialization and deserialization 

[0027] Serialization is an operation used in transferring data from 
one component to another. That is, data is serialized at the sending end, 
transferred to a receiving end and then deserialized at the receiving end. Seri- 
alization is the operation where the data content of a data object is transformed 
into a stream of bits that may be formatted into bytes. The bit/byte stream can 
be respectively deserialized in order to reconstruct the data content into a 
compatible data object. The format of the bit/byte stream can be simplified 
and/or compressed in order to improve performance or it can be a tagged text 
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format, such as XML, in order to make it readable to different systems or by 
humans. 

[0028] In an embodiment of the invention, not only actual data, but 
also information about a source data object can be included into serialized 
data in order to support the checking of compatibility against a target data ob- 
ject or just to help examine the byte stream. If data is recursive or if the data 
structures are known only at run time, they can be identified in the byte stream 
and allocated respectively at deserialization. In order to prevent serializing the 
same data multiple times, each data structure instance may be assigned a 
unique identifier in the complete byte stream context. When a data structure 
instance is encountered again during the same serialization or deserialization 
operation, only a reference to it can be processed, instead of processing the 
instance again. 

Structural data 

[0029] When a data item contains more than a single value (a num- 
ber, a string or a fixed binary block), data transfer is easier to manage by proc- 
essing it as a structural entity rather than transferring each value separately. A 
structure can be simply a fixed set of values, each having a known data type. 
Serializing such structures is quite a simple operation but has such limited 
characteristics that they are not very useful. 

[0030] Common, more advanced characteristics for structural enti- 
ties include: 

[0031] - Defining structures as a set of items where an item can be 
defined to have a simple or structural type. Structures can be nested inside 
each other. 

[0032] - Defining items optionally as arrays. An array may have a 
fixed size or a variable size and a limit for a maximum number of elements. 
The array implementation may be limited to only one dimension or it may sup- 
port multiple dimensions. 

[0033] - Defining some or all data items optional so that instead of 
always containing a value, the data item may contain just an indication of a 
missing value. 

[0034] - Defining some data items to be alternative or optional ac- 
cording to a separate discriminant value. In a structure containing a set of al- 
ternative items (such as a union of the programming language C), the discrimi- 
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nant value determines which item is used. The discriminant value can also be 
for example a bit mask where each bit determines the existence of the respec- 
tive item. 

[0035] - Defining the structures as classes in an object-oriented de- 
sign, thus having the definitions of a possible super class acknowledged for 
each class. 

[0036] - A loose-type relationship between declared data items and 
data items actually passed. The data item could simply refer to any structural 
entity. In an object-oriented design, an instance of a subclass could be passed 
instead of an instance of the declared class. 

[0037] - A combination of the above features can be used to pass 
for example lists with elements of different types or recursively nested struc- 
tures. 

Data exchange metadata 

[0038] The serialization of structural data requires access to infor- 
mation on definitions of the structures to serialize. With the assumption that 
serialization is mainly used for exchanging data, that information is referred to 
here as data exchange metadata. 

Producing new software version 

[0039] Referring now to Figure 2, let us assume that a new version 
of distributed software is produced (step 200). Quite often an update to a dis- 
tributed software system means adding a new functionality and retaining an 
existing functionality. In such case, it is likely that the client components of the 
previous software version are functionally compatible with the new version of a 
server component but the data structures exchanged between the components 
may have changed. Thus, in many cases where the new functionality of the 
server component is compatible with the functionality of the old client compo- 
nents, with the exception of modifications to the data structures, the new 
server component can co-operate with such old client components assuming 
that the server component fully accepts the data structures and items in the 
way they were defined for the client component, i.e. the server component 
must have available the data exchange metadata descriptions according to the 
invention. 

[0040] Thus, for example, if the data exchange metadata descrip- 
tion is produced for each software version that is delivered, and the data ex- 
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change metadata descriptions of all compatible versions are delivered with 
each software version, the server component is able to exchange data using 
the data exchange metadata of any compatible client component. 

[0041] However, in order to maintain backward compatibility, all the 
services offered and data structures used with previous versions must be kept 
available. The data structures can be modified but the data items must be 
found by using old names, and the types of the data items must be kept back- 
ward compatible (convertible). For any removed or added item, there must be 
a way to determine a default value. In case there is any need for renaming the 
data structures or the data items in them, the name changes must be declared 
to complement the metadata information. The declaration is necessary so as to 
find the matching structure or item using the old name. 

Producing a data exchange metadata description 

[0042] In accordance with the present invention, data exchange 
metadata descriptions are produced both for the components of the new soft- 
ware version (step 201) and for all older software versions which should be 
compatible (step 202). There are various ways to produce the metadata de- 
scriptions. 

[0043] In applications using serialization, the data structures used in 
the programs are usually similar to those used in the serialization. In such a 
case, the data exchange metadata is most often based on a common source 
code for both uses. When the applications sharing serialized data are separate 
from each other, and possibly not implemented using the same programming 
or scripting language, the data exchange metadata can be described using a 
tool or language (e.g. CORBA IDL, ISO/lEC 14750) dedicated to describing 
such metadata. When using a programming language that provides access to 
the metadata of language objects, the source of the data exchange metadata 
can be simply a part of the program source code. 

[0044] If the data exchange metadata is deduced from type defini- 
tions of a program source code, the definitions should be distinguishable from 
other definitions. If they are not distinguished, that information is difficult to 
manage and it does not help in the verification of compatibility between soft- 
ware versions. In practice, the difficulty in managing this information means 
scanning it multiple times during the serialization and deserialization opera- 
tions, thus severely compromising performance. 



9 



First Filing Application of ARO et al., atty. dkt. 290636 



[0045] The data exchange metadata must have a defined descrip- 
tion format in order to be able to permanently store it and to restore it to the 
software. The details of the format are not significant. 

[0046] An example of an algorithm for writing a metadata descrip- 
tion according to an embodiment of the invention will now be described with 
reference to Figures 3A and 3B. The metadata description reflects the data 
type description information required for a backward compatible serialization. 
The syntax of data can be freely chosen. The preferred source of metadata 
information is the definition of the data types in terms of the programming lan- 
guage. This is possible only with programming languages providing access to 
the type definition. Firstly, in step 300, information about the delivery version of 
the new software component is recorded to the metadata description. It should 
be noted that in Figures 3A and 3B, the dotted lines represent writing opera- 
tions to the metadata description 31 . Then, a collection 30 of data types to be 
serialized is accessed in step 301. If a language providing access to the type 
definitions is used, the collection 30 can be simply a list of references to those 
definitions. Steps 302 to 313 are then performed for each data type to be seri- 
alized. Firstly, the name and structure information is recorded to the metadata 
description 31 (step 303). Then, it is checked (step 304) whether the data type 
is structural or not. If the data type is not structural, the process proceeds di- 
rectly to step 313 and then returns to step 302 to handle the next data type. 
However, if the data type is structural, methods for converting the respective 
data type to and from another data type are recorded to the metadata descrip- 
tion 31 for each data type found compatible (steps 305, 306 and 307). Then, it 
is checked for each field of the data type whether a default value is accessible 
(steps 308 and 309). If a default value is accessible, the access to the default 
value is recorded to the metadata description 31 (step 31 0), whereafter the 
process continues by recording the name and type information of the field to 
the metadata description 31 (step 31 1). If the default value is not accessible in 
step 309, the process moves directly to step 31 1 . When all fields of the data 
type have been processed in step 312, the next data type is processed via 
step 31 1. When all data types have been processed in step 313, the metadata 
description 31 is terminated in step 314. 
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Compatibility verification 

[0047] After the data exchange metadata descriptions have been 
provided, the compatibility between different software versions may be verified 
(step 203 in Figure 2). Compatibility verification can be performed as a part of 
the development work in order to know if the given versions have serialization 
compatibility. Preferably, compatibility verification is performed using an auto- 
matic verification algorithm when preparing the version delivery. Version com- 
patibility can be checked in a most reliable manner by using the data exchange 
metadata of the delivery versions, assuming that the data structures used for 
the data exchange match those used inside the application programs. The 
verification algorithm is executed against the data exchange metadata descrip- 
tions of previous deliveries as far as compatibility is considered necessary. The 
verification algorithm loads in the data exchange metadata for the current ver- 
sion and for the version to compare it with. The algorithm compares the name 
and type information of data structures and makes sure that all compatible 
structures still exist, that data items with compatible types exist and that for 
each removed and added data item, there is a way to determine a default 
value. 

[0048] An example of the verification algorithm is shown in Figures 
4A and 4B. In this example, when the verification is performed in order to seri- 
alize data between different versions, the exemplary algorithm also prepares 
access to the data items in the host language. The details of such preparation 
depend on the programming language used. 

[0049] The algorithm of Figures 4A and 4B illustrates steps for veri- 
fying compatibility for one structural type, i.e. it must be repeated separately for 
each different structural type. Firstly, the metadata description 401 for the old 
structural type (of the old version) and the metadata description 302 for the 
new structural type (of the new software version) are inputted (loaded) to the 
verification algorithm. Then, for each field of the newer version of the structural 
type, it is checked whether the same field can be found in the older structural 
version (steps 303, 304 and 305). If the specific field is not found in the older 
version, i.e. a new field has been added to the new version, it is checked 
whether a default value for such a field is accessible (steps 306 and 307). If 
such a value is not accessible, an error message is generated indicating that a 
default value is required for the new field. Upon receiving the error message 
from the algorithm, the software developer can define an appropriate default 
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value which is preferably stored to the metadata description of the newer ver- 
sion. If the default value is accessible in step 307, the access to the default 
value is recorded in step 309, and the algorithm proceeds to step 310. If the 
field of the newer version can be found in the older version in step 305, the 
algorithm proceeds directly to step 310. 

[0050] In step 310, it is checked whether the field types are identical 
in the newer and older versions. If the check is affirmative, the access to the 
field (a reflection reference or a structure offset) is recorded in step 31 1 . How- 
ever, if the field types are not identical, it is checked whether the field type of 
the older version is compatible with the field type of the newer version (step 
312), and the algorithm proceeds to step 311. However, if the field types are 
not compatible in step 312, the algorithm generates an error message indicat- 
ing that the field types must be compatible (convertible). Upon receiving the 
error message, the program developer can perform an appropriate measure to 
remove the error situation, if possible. 

[0051] After step 31 1 , it is checked whether there are more fields in 
the newer version to be verified (step 315). If yes, the algorithm returns to step 
303. If not, the algorithm proceeds to step 316 to verify each field of the older 
structural-type version. Firstly, it is checked whether the field of the older ver- 
sion can be found in the newer version (steps 317 and 318). If the same field 
cannot be found in the newer version, it is checked whether a default value for 
this field is accessible (steps 319 and 320). If a default value is accessible, the 
access to the default value is recorded (step 321), and the algorithm proceeds 
to step 323. However, if a default value cannot be accessed, the algorithm 
generates an error message indicating that a default value required for the re- 
moved field is missing (step 322). Upon receiving the error message (step 
322), the software developer can carry out an appropriate measure to remove 
the error situation. After step 380, it is checked whether there are more fields 
to verify in the older structural version. If there are more fields, the algorithm 
returns to step 316. If all fields have been verified, the compatibility verification 
has been completed. The serialization algorithm results in a serialization 
scheme 424, as will be explained in more detail below. This serialization 
scheme can be delivered with the new software version, instead of or in addi- 
tion to the data exchange metadata description(s) of the newer version, and 
optionally the metadata description(s) of the older version(s). 
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[0052] The same verification algorithm can also be used when pre- 
paring the serialization process in an installed server component on the basis 
of the metadata descriptions. However, when the algorithm is used for prepar- 
ing the serialization, no error messages should be generated, since any error 
situation has been detected and corrected already during the verification prior 
to the delivery of the new software version. 

[0053] Thus, this aspect of the present invention enables version 
compatibility to be checked automatically in a very reliable manner by using 
the data exchange metadata of the delivery versions. Most of the error situa- 
tions can be detected and corrected before the delivery. 

Delivery of new software 

[0054] The serialization scheme(s) and the data exchange meta- 
data description of the new software version and of compatible software ver- 
sions can be delivered with the specific software version (step 204), in which 
case they are stored in the server computer 1 during the installation of the new 
server component 5. However, especially if the data exchange metadata of the 
older version comprises a relatively small amount of information, it can be al- 
ternatively passed in a session handshake procedure (instead of delivering all 
the data exchange metadata descriptions or serialization schemes with each 
new version), thus determining the compatibility at run time. The data ex- 
change metadata may be obtained from the counterpart software component, 
or it may be retrieved from a centralized database. 

Data exchange using serialization and deserialization with metadata de- 
scriptions 

[0055] Figure 5 is a block diagram illustrating a data exchange ses- 
sion between an older client component (version v1) and a newer server com- 
ponent (version v2). In the exemplary configuration shown in Figure 5, sepa- 
rate serialization layers 500 and 501 are also shown. Implementation of the 
serialization layers 501 and 502 for a remote service calling typically necessi- 
tates that the remote call API (Application Programming Interface) is kept 
backward compatible and that the serialization layers support the version of 
the local and remote component. In addition to a serialization version- 
compatibility verification described below, a call API compatibility verification 
may also be used. 
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[0056] In the embodiment shown in Figure 5, the software compo- 
nent having the newer version (i.e. the server component 5) is responsible for 
all measures needed for achieving compatibility with the older version (i.e. the 
client component). To this end, the serialization layer 502 of the server compo- 
nent 5 includes appropriate operations, such as a version check 503 and ver- 
sion conversions 504. The client component of the older version v1 does not 
need to take any additional measures for compatibility, and therefore, the seri- 
alization layer 501 does not contain any extra operations for achieving com- 
patibility with the newer version v2 of the server component 5. However, it is 
possible that in addition to backward compatibility, forward compatibility is 
supported by similar means. For example, an older server component may not 
offer all services that a newer version of a client component expects. Thus, the 
serialization layer 501 of the client component 6 may contain operations for 
achieving compatibility with such an older version of a server component. 

[0057] The advantage of performing backward compatible serializa- 
tion only in a more advanced counterpart of the data exchange is that the client 
component that needs to co-operate only with the server components of the 
same version or a later version does not need any modification or updating for 
achieving serialization compatibility. Another advantage of performing com- 
patibility measures on the newer version side is being able to immediately util- 
ize new features for communication with old versions that do have these fea- 
tures. 

Version check 

[0058] Information about the delivery version of the software com- 
ponent must be passed in the beginning of the data exchange between the 
client and server components 5 and 6. For unidirectional transfer, the version 
is simply identified in the beginning of the byte stream. For bi-directional mes- 
saging, the first contact 505 within a data transfer session could be a hand- 
shake procedure where the version identifier is passed and the software com- 
ponents 5 and 6 get to know if and how they are compatible. Optionally, this 
procedure may also support client version updating whenever it is considered 
preferable. 

Preparing serialization 

[0059] Referring to Figure 6, serialization is prepared by verifying 
the compatibility and initiating a serialization scheme (step 601). Serialization 
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can be prepared either by a single operation concerning all possible serialized 
data types, or by separate operations during the serialization of each data 
type. When this preparation is launched in order to verify the compatibility of 
the data types, it should be performed for all types to be serialized. 

[0060] Figure 7 is a flow diagram which illustrates the process for 
preparing the serialization. The server component 5 having the newer version 
v2 loads the version identifier of the local version (the version v2) and the re- 
mote counterpart version (the version v1). Then the version check procedure 
503 of the server component 5 determines whether the remote version is the 
same as the local version (step 702). If the versions are the same, a direct se- 
rialization can be used and no further preparation of the serialization is needed 
(step 703). However, if the remote version and the local version (e.g. the client 
component version v1 and the server component version v2) are not the same, 
it is checked whether the remote version (e.g. version v2) is already known by 
the server component 5 (step 704). If the remote version is already known, i.e. 
a serialization scheme has already been initialized for this remote version 
(prior to a delivery or during a previous communication), this earlier initialized 
serialization scheme is used (step 705) and the preparation of the serialization 
is ended. However, if the remote version is not known, compatibility is verified 
and a serialization scheme is initialized for this version (step 706). After step 
706, the preparation of the serialization is ended. 

[0061] The verifying and initialization step 706 can be carried out by 
the algorithm described above with reference to Figures 4A and 4B. When 
compatibility has been successfully verified, the algorithm results in an initial- 
ized serialization scheme 424. The serialization scheme contains information 
needed in the version v2 of the server component 5 for the serialization of data 
to and the deserialization of data from the client component 6 running the older 
version v1 . The serialization scheme is typically prepared only once for each 
type of a certain target version and stored in memory. All types of a certain 
target version can be prepared at once. 

Serialization and deserialization 

[0062] Referring again to Figure 6: after the serialization has been 
prepared, the serialization and deserialization of the data items are performed 
during the data exchange (step 602). The serialization and deserialization op- 
erations are needed for each data item to be transferred. For remote services, 
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the data items are a remote service parameter in the request message 506 
and the return value(s) in the reply message 507 in Figure 5. An example of a 
serialization process is illustrated by the flow chart shown in Figures 8A and 
8B. The serialization is started by inputting the data item 801 to be serialized 
(received from the server component 5) as well as the target version identifier 
802 indicating the version of the software with which the serialized data should 
be compatible. Then, it is checked whether the type of the data item 801 is 
structural or not (steps 803 and 804). If the type is not structural, the data item 
is serialized (step 805) and written out to the serialized byte stream 800, 
thereby completing the serialization of the specific data item. If the type of the 
data item 801 is structural, a serialization scheme for the type of the data item 
801 is retrieved (806) from a serialization scheme memory 807. It is checked 
for each field of the target version v1 in the retrieved serialization scheme 
whether the respective field can be found in the current version v2 (steps 808 
and 809). If a field cannot be found in the current version, a default value is 
kept for the specific field (step 810), and the process proceeds to serialize the 
data item 801 according to the target type (step 814) and to write out the field 
into the serialized byte stream 800. However, if the field can be found in the 
current version v2, the field value is obtained from the data item 801. Then, it is 
checked whether the types of the data items are identical in the two versions 
v1 and v2 (step 812). If not, a conversion of the type is carried out (step 813), 
and the resulting data item is serialized in step 814. If the types of the data 
items are identical in the two different versions v1 and v2, the data item is seri- 
alized in step 814. Then, in step 815, it is checked whether all fields of the tar- 
get version v1 have been processed. If not, the process returns to step 808. If 
all fields have been processed, the serialization of the data item 801 has been 
completed. As a result, a serialized byte stream 800 is transferred from the 
server component 5 to the client component 6. 

[0063] An example of a deserialization process is illustrated by a 
flow chart shown in Figures 9A and 9B. Firstly, an (uninitialized) data item 901 
and source version identifier 902 are inputted to the deserialization process. 
Then the data type is obtained from the inputted data item 901 or read from a 
serialized byte stream 903 received from the client component 6 (step 904). 
Then, it is checked whether the type of the data item is structural (step 905). If 
the data type is not structural, the serialized byte stream is read in and the data 
item is deserialized (step 906). However, if the data type is structural, a seriali- 
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zation scheme for the specific type of the data item is loaded from the seriali- 
zation scheme memory 907 (step 908) Then, it is determined for each field of 
the source version v1 in the serialization scheme whether the specific field is in 
the current version v2 or not (steps 909 and 910). If the specific field cannot be 
found in the current version v2, a value is set for the specific field in the data 
item (step 914). If the specific field is found in the current version v2, the seri- 
alized byte stream 903 is read in and the data item is deserialized according to 
the source type (step 91 1). Then it is checked whether the types of the data 
items are identical in the two versions v1 and v2 (step 912). If not, a conver- 
sion of the type is carried out (step 913), and the process continues to step 
914. If the types are identical in the two versions v1 and v2, the process pro- 
ceeds directly to step 914. After step 914, it is checked whether all fields of the 
source version in the serialization scheme have been processed (step 915). If 
not, the process returns to step 909. If all fields in the serialization scheme 
have been checked, the process is arranged to set values to fields that are 
only in the current version v2 by accessing respective default values (step 
916). 

[0064] The implementation of the serialization may have a signifi- 
cant influence in the performance of the software, because in addition to the 
resources consumed in the serialization and deserialization operations, the 
implementation also determines the size of the data stream exchange. The 
influence depends on how much the operations involve data exchange. 

[0065] The key to a high performance is serializing only the essen- 
tial data and nothing more. When data exchange metadata including all com- 
patibility considerations is already known by all concerned components, only a 
short identifier needs to be serialized for each data structure instance instead 
of the complete structure of the metadata. As the version identifier is passed in 
the beginning of the contact, it does not need to be serialized. 

[0066] Whenever possible, the data structures should be assigned 
permanent numeric identifiers that can be used to identify them instead of us- 
ing names as identifiers. Using numerical identifiers saves resources espe- 
cially in deserialization and reduces the length of serialized data. Whatever 
preparation the serialization or deserialization may require for each passed 
data structure, such preparation is preferably performed only once. The seriali- 
zation and deserialization operations should preferably minimize the time con- 
sumed for anything else than for the byte stream conversions. 
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An example 

[0067] In the following, simplified examples of the metadata de- 
scriptions and serialization schemes are shown. In this simplified example, an 
application written in Java serializes a few structural data items (objects). The 
metadata description format in the example is XML. 

[0068] The version 1 (v1) of the software has three classes to seri- 
alize and the version 2 (v2) has one additional class. Two of the classes have 
one new field in the version 2 in comparison with the version 1 . 

Version 1 classes 



[0069] Customer class 

package com. stonesoft . sample; 
public class Customer 

{String name; 

String address; 



[0070] Order class 

package com. stonesoft . sample; 
public class Order 

Customer customer; 
Product product ; 



[0071] Product class 

package com. stonesoft . sample; 

public class Product 

{ 

String reference; 
int priceCategory; 



Version 1 metadata description 

[0072] This description can be produced using the algorithm shown 
in Figures 3A and 3 B. 

<METADATA_DESCRI PTION ver S ion= " vl " > 

<TYPE name=" Customer" structural="yes "> 

<FIELD name="name" type="native : String" /> 
< FIELD name="address" type="native: string" /> 
</TYPE> 

<TYPE name="Order" structural="yes"> 

<FIELD name="customer" type="custom:Customer"/> 

<FIELD name= "product" type="custom: Product " /> 
</TYPE> 

<TYPE name="Product" structural="yes"> 

<FIELD name="name" type="native: String" /> 

<FIELD name="reference" type="native : String"/> 

<FIELD name="priceCategory" type="primitive:int"/> 
</TYPE> 
< /METADATA_DESCRI PT ION> 
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Version 2 classes 

[0073] The Address class has a special constructor and toString() 
method for enabling compatibility with the String type. 

[0074] Address class 

package com. stonesoft . sample; 

public class Address 
{ 

public static final int POSTAL_CODE_LENGTH = 5; // Constant; not serialized 

String streetAddress; 

String postalCode; 

String city; 

String country; 

/** 

* <P>Constructor for supporting conversion from address as String. 

* <P>The parts of address are separated according to finding the 

* postal code. 
*/ 

public Address (String pAddress) 

{ 

int postalCodeOf f set = 0; 

int offset = 0; 

int postalCodeLength = 0; 

boolean isPostalCodeFound = false; 

while ( (! isPostalCodeFound) && 

(offset < (pAddress. length () - POSTAL_CODE_LENGTH) ) ) 

i 

char currentChar = pAddress. charAt (offset) ; 

if (Character . isSpace (currentChar) ) 

{ 

postalCodeOffset = offset + 1; 

} 

if ( (postalCodeOffset != 0) && (Character .isDigit (currentChar) ) ) 
{ 

while ( (postalCodeLength < POSTAL_CODE_LENGTH) && 
(Character . isDigit (pAddress .charAt ( 

offset + postalCodeLength) ) ) ) 

{ 
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postalCodeLength++ ; 

} 

isPostalCodeFound = (postalCodeLength == OSTAL_CODE_LEHGTH) 

} 

of fset++; 

} 

if (isPostalCodeFound) 
{ 

while ( (postalCodeLength < (POSTAL_CODE_LENGTH * 2) ) && 
(! Character .isSpace (pAddress . charAt ( 

postalCodeOf f set + postalCodeLength) ) ) 

{ 

postalCodeLength++; 

> 

streetAddress = pAddress . substring (0 , postalCodeOf f set ). trim( ) ; 
postalCode = pAddress . substring (postalCodeOf f set, 

postalCodeOf f set + postalCodeLength) . trim ( ) 
int countryOffset = pAddress . indexOf (" \n", postalCodeOf f set ) ; 
int cityOffset = postalCodeOf f set + postalCodeLength; 
if (countryOffset == -1) 
{ 

city = pAddress. substring (cityOf fset) .trimO ; 
country = '*"; 



city = pAddress. substring (cityOffset, countryOffset) .trimO 
country = pAddress . substring (countryOffset) . trim () ; 



streetAddress 
postalCode = 1 
city = ""; 
country = ""; 



: Default constructor. 



public Address! String pStreetAddress, 

String pPostalCode, 
String pCity, 
String pCountry) 

{ 

streetAddress = pStreetAddress; 
postalCode = pPostalCode; 
city = pCity; 
country = pCountry; 

} 



* Conversion to string from internal format. 
*/ 

public String toString ( ) 

( 

return streetAddress + "\n" + postalCode • 

} 
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[0075] Customer class 

package com. stonesoft . sample; 

public class Customer 

{ 

Address address; 
int priority; 

/** 

* Default value for priority field, this method is used by convention. 
*/ 

public int getDef ault_priority ( ) 



[0076] Order class 

package com. stonesoft . sample; 

public class Order 
{ 

Customer customer; 
Product product; 
int quantity; 

/** 

* Default value for quantity field, this method is used by convention. 

*/ 

public int getDef ault_quantity () 

{ 

} 



[0077] Product class 

package com. stonesoft. sample; 

public class Product 
{ 

String reference; 
int priceCategory; 
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Version 2 metadata description 

[0078] This description is produced using the algorithm presented in 
Figures 3A and 3B. 

<METADAT A_DES CRI PTION ve r s ion= " v2 " > 

<TYPE name="Address" structural='*yes"> 
<COMPAT I BLE_TYPE name= " S t r ing " > 

<CONVERT_TO by_method="toString"/> 
<CONVERT_FROM by_constructor /> 
</COMPATIBLE_TYPE> 

<FIELD name="streetAddress" type="native : String" /> 
<FIELD name="postalCode" type="native : String" /> 
<FIELD name="City" type="native : String" /> 
<FIELD name="Country" type="native : String"/> 
</TYPE> 

<TYPE name="Customer" structural="ye5"> 

<FIELD name="name" type="native : String" /> 

<FIELD name="address" type="custom: Address"/> 

<FIELD name="priority" type="primitive : int"> 

<DE FAULT_VALUE by_method="getDef ault_priority"/> 

</FIELD> 
</TYPE> 

<TYPE name="Order" structural="yes"> 

< FIELD name="customer" type="custom: Customer"/> 

<FIELD name="product" type="custom: Product"/> 

<FIELD name="quantity" type="primitive : int"> 

<DEFAULT_VALUE by_method="getDe f ault_quantity" /> 

</FIELD> 
</TYPE> 

<TYPE name="Product" structural="yes"> 

<FIELD name="name" type="native : String"/> 

<FIELD name="ref erence" type="native : String"/> 

<FIELD name="priceCategory" type="primitive : int"/> 
</TYPE> 
< / MET ADATA_DES CRI PT I ON> 

Serialization scheme 

[0079] This serialization scheme example describes the information 
needed in a component running version 2 for serializing data to and deserial- 
izing data from a component running version 1 

[0080] As a result of serialization preparation (see Figure 7 and Fig- 
ures 4A and 4B), the serialization scheme used in serialization (see Figures 8A 
and 8B) and deserialization operations (see Figures 9A and 9B), contains the 
following information: 
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• [0081] Customer; Structural, updated: 

• name; String, identical. 

• address; Was String, now Address, conversions: toString() method & constructor. 

• priority; int, new field, default value by getDefault_priority() method. 

• [0082] Order; Structural, updated: 

• customer; String, identical. 

• product; String, identical. 

quantity; int, new field, default value by getDefault_quantity() method. 

• [0083] Product; Structural, identical: 

• name; String, identical. 

• reference; String, identical. 

• priceCategory; int, identical. 

[0084] Even though the invention is described above with reference 
to the examples, it should be appreciated that the invention is not restricted to 
these examples but can be modified within the scope of the inventive idea dis- 
closed above and in the appended claims. 
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